Understanding Operational Risk Capital Approximations: First and Second Orders 

Gareth W. Peters 1 ' 2 ' 3 Rodrigo S. Targino 1 Pavel V. Shevchenko 2 



1 Department of Statistical Science, University College London; 
email: gareth.peters@ucl.ac.uk 
(Corresponding Author) 
CSIRO Mathematics, Informatics and Statistics, Sydney, Australia 
' 5 Associate Fellow, Oxford-Man Institute, Oxford University 



Preprint submitted to Elsevier 



March 13, 2013 



Understanding Operational Risk Capital Approximations: First and Second Orders 

Gareth W. Peters 1,2 ' 3 Rodrigo S. Targino 1 Pavel V. Shevchenko 2 



Abstract 

We set the context for capital approximation within the framework of the Basel II / III regulatory capital accords. This is 
particularly topical as the Basel III accord is shortly due to take effect. In this regard, we provide a summary of the role 
of capital adequacy in the new accord, highlighting along the way the significant loss events that have been attributed to 
the Operational Risk class that was introduced in the Basel II and III accords. Then we provide a semi-tutorial discussion 
on the modelling aspects of capital estimation under a Loss Distributional Approach (LDA). Our emphasis is to focuss 
on the important loss processes with regard to those that contribute most to capital, the so called "high consequence, 
low frequency" loss processes. 

This leads us to provide a tutorial overview of heavy tailed loss process modelling in OpRisk under Basel III, with discussion 
on the implications of such tail assumptions for the severity model in an LDA structure. This provides practitioners with 
a clear understanding of the features that they may wish to consider when developing OpRisk severity models in practice. 
From this discussion on heavy tailed severity models, we then develop an understanding of the impact such models have 
on the right tail asymptotics of the compound loss process and we provide detailed presentation of what are known as 
first and second order tail approximations for the resulting heavy tailed loss process. From this we develop a tutorial on 
three key families of risk measures and their equivalent second order asymptotic approximations: Value-at-Risk (Basel III 
industry standard); Expected Shortfall (ES) and the Spectral Risk Measure. These then form the capital approximations. 

We then provide a few example case studies to illustrate the accuracy of these asymptotic captial approximations, the 
rate of the convergence of the assymptotic result as a function of the LDA frequency and severity model parameters, the 
sensitivity of the capital approximation to the model parameters and the sensitivity to model miss-specification. 

Key words: Basel II /III; Capital Approximation; Loss Distributional Approach; Capital Approximation; Value-at-Risk; 
Expected Shortfall; Spectral Risk Measure; Subexponential; Regularly Varying. 



1. The Changing Landscape of Capital Accords 

In juristictions in which active regulation is applied to the banking sector throughout the world, the modelling of Opera- 
tional Risk (OpRisk) has progressively taken a prominent place in financial quantitative measurement. This has occurred 
as a result of Basel II and now Basel III regulatory requirements. For example in the context of banking regulation in 
Australia, the basic framework of Basel II /III is summarized in Figure [I] In this juristiction one observes that a large 
amount of the developments in quantitative methodology for estimation of OpRisk capital, development of OpRisk frame- 
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works embedded within retail banks and large financial institutions as well as infrastructure for collection and reporting 
of losses in data bases has been achieved largely due to regulatory choices to link Advanced Measurement Approaches 
in OpRisk modelling to Credit Risk. In other areas the progression of such features has lagged behind the Australian 
example but now OpRisk in other large banking sectors is becomming increasingly prominent. 

There has been a significant amount of research dedicated to understanding the features of Basel II, see for example 
PQ, [5] and [Hj- In addition the mathematical and statistical properties of the key risk processes that comprise OpRisk, 
especially those that contribute significantly to the capital charge required to be held against OpRisk losses have also 
been carefully studied, see for example the book length discussions in [4], [5] and [6]. 

There have been both numerical and simulation based approaches adopted as well as analytical mathematical developments 
of closed form approximations for capital approximation. In this manuscript we aim to provide a clear and concise 
understanding of several increasingly popular approaches to capital approximation from an analytic perspective. In the 
process we demonstrate several important key details of such capital approximations, we discuss the implications of use 
of such approximations, their shortcommings and we assess their behaviours in simple but realistic loss models adopted 
in practice. 

In particular we provide a detailed discussion on the attributes of the so called capital approximation methods known 
as Single Loss type approaches of first and second order expansions. In doing so, this manuscript draws together several 
disperate areas of the literature to allow industry professionals an insight into the processes that have lead to expansions 
of tail functionals for risk measures such as Value-at-Risk, Expected-Shortfall and Spectral Risk measures, considered for 
capital definitions. 

In understanding the context for these developments we discuss first the regulatory evolution of OpRisk modelling frame- 
works. In January 2001 the Basel Committee on Banking Supervision proposed the Basel II Accord ([7], [8], [9]) which 
replaced the 1988 Capital Accord. Now in 2013 the Basel III Accord is due to start to be considered. Since the initiation 
of the Basel captial accords, the discipline of OpRisk and its quantification have grown in prominence in the financial 
sector. Paralleling these developments have been similar regulatory requirements for the insurance industry which are 
referred to as Solvency 2. 

Under the Basel II/III structures there is at the core the notion of three pillars, which, by their very nature, emphasize 
the importance of assessing, modelling and understanding OpRisk loss profiles. These 3 pillars are; minimum capital 
requirements (refining and enhancing riskmodelling frameworks), supervisory review of an institutions capital adequacy 
and internal assessment processes and market discipline, which deals with disclosure of information, see Figure [l] 

In the third update to the Basel Accords due for implementation in the period 2013-2018, a global regulatory standard 
which draws together bank capital adequacy, stress testing and market liquidity is created. It is established as an 
international best practice for modelling OpRisk by the members of the Basel Committee on Banking Supervision, see 
[TU] and discussions in [IT] . 

The Basel III accord naturally extends on the work developed in both the Basel I and Basel II accords with the new 
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accord arising primarily as a response to the identified issues associated with financial regulation that arose during the 
recent global financial crisis in the late-2000s. In this regard, the Basel III accord builds on Basel II by strengthening the 
bank capital requirements as well as introducing additional regulatory requirements on bank liquidity and leverage. 

Banking regulation under Basel II and Basel III specifies that banks are required to hold adequate capital against OpRisk 
losses. OpRisk is a relatively new category of risk which is additional to more well established risk areas such as market 
and credit risks. As such in its own right OpRisk attracts a capital charge which is defined by Basel II/III [1, p. 144] as: 
"/■ • • / the risk of loss resulting from inadequate or failed internal processes, people and systems or from external events. 
This definition includes legal risk, but excludes strategic and reputational risk." OpRisk is significant in many financial 
institutions. 

Before detailing the changes to capital requirements due to come into industry practice under Basel III, it is prudent to 
recall the Basel definition of Tier I capital, which is the key measure of a bank's financial strength from the perspective 
of the regulatory authority. In particular the capital accord in Basell II and III states that financial institutions must 
provide capital above the minimum required amount, known as the floor capital. In addition this capital as specified 
in regulation is comprised of three key components, Tier I, Tier II and Tier III. Both Tier I and Tier II capital were 
first defined in the Basel I capital accord and remained substantially the same in the replacement Basel II and Basel III 
accords. 

Definition 1.1 (Tier I Capital). The Tier I capital under regulation is comprised of the following main components: 

1. Paid-up share capital / common stock; 

2. Disclosed Reserves (or retained earnings). 

It may also include non-redeemable non-cumulative preferred stock. 

The Basel Committee also noted the existance of banking strategies to develop instruments in order to generate Tier I 
capital. As a consequence, these must be carefully regulated through imposition of stringent conditions, with a limit to 
such instruments at a maximum of 15% of total Tier I capital. 

Definition 1.2 (Tier II Capital). The Tier II capital under regulation is comprised of the following main components: 

1. Undisclosed reserves; 

2. Asset revaluation reserves; 

3. General provisions / general loan-loss reserves; 

4. Hybrid (debt/equity) capital instruments; and 

5. Long-term subordinated debt. 

In this regard one may consider Tier II capital as representing so called "supplementary capital". 

We note at this stage that as a consequence of different legal systems in each juristiction, the accord has had to be 
sufficiently flexible to allow for some interpretation of specific capital componets within the context of each regulators 
juristiction. Depending on the particular juristiction in question, the specific country's banking regulator has some 
discretionary control over how exactly differing financial instruments may count in a capital calculations. 
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Remark 1.3. The key reason that Basel III requires financial institutions to hold capital is that it is aimed to provide 
protection against unexpected losses. This is different to mitigation of expected losses, which are covered by provisions, 
reserves and current year profits. 

We note that modifications under the Basel III accord relative to its predecessor refer to limitations on Risk Weighted 
Capital (RWC) and the Tier I Capital Ratio, as defined below. 

Definition 1.4 (Risk Weighted Assets). These assets comprise the total of all assets held by the bank weighted by 
credit risk according to a formula determined by either the juristictions regulatory authority or in some cases the central 
bank. Most regulators and central banks adhere to the definitions specified by the Basel Committee on Banking Supervision 
(BCBS) guidelines in setting formulae for asset risk weights. Liquid assets such as cash and coins typically have zero risk 
weight, while certain loans have a risk weight at 100% of their face value. As specified by the BCBS the total RWA is not 
limited to Credit Risk. It contains components for Market Risk (typically based on value at risk (VAR) ) and Operational 
Risk. The BCBS rules for calculation of the components of total RWA have also been updated as a result of the recent 
financial crisis. 

Definition 1.5 (Tier I Capital Ratio). The Tier 1 capital ratio is the ratio of a bank's core equity capital to its total 
risk-weighted assets (RWA). 

Next we highlight the prominent extensions to the Basel II accord, established in the Basel III accord. In particular the 
Basel III accord will require financial institutions to hold for risk weighted assets, 4.5% of common equity which is an 
increase from the previous 2% under Basel II as well as 6% of Tier I capital itself an increase by 2% relative to Basel II. 
In addition to these changes to common equity and Tier I capital, Basel III also introduces a minimum leverage ratio and 
two additional required liquidity ratio limits. Finally, of the significant changes, there are also additional capital buffers 
introduced: 

1. Introduction of a mandatory capital conservation buffer of 2.5% ; and 

2. A discretionary countercyclical buffer, allowing national regulators to require up to another 2.5% of capital during 
periods of high credit growth. 

1.1. Understanding the Significance of OpRisk Losses 

To illustrate the significance of OpRisk losses on the stability of banking operations and the extent that particular loss 
processes in this class of risk can threaten financial insolvency for banks we note some aggregate loss figures and illustrate 
how such aggregates are possible by noting particular instances of prominent loss events that have occurred under the 
OpRisk class. 

It was reported in [TJ] that the total loss associated with operational risk has reached as high as US$ 96 billion in the 
United States during the financial crisis in 2008. There have also been numerous OpRisk loss events that have been 
highlighted in the media to support such enormous aggregate figures. Such single event examples of extremely large 
OpRisk losses include: Barings Bank (loss GBP 1.3 billion in 1995), Sumitomo Corporation (loss USD 2.6 billion in 
1996), Enron (USD 2.2 billion in 2001), and recent loss in Society Generale (Euro 4.9 billion in 2008). 

Some of the lesser reported cases have recently come to light with the paper of [13] who paint similar pictures in the 
Chinese banking sectors as have been observed in US and European markets. For example they state that typical 
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Figure 1: The Basel II and now Basel III pillars for capital adequacy assessment. 

examples of large OpRisk loss events in recent years in the Chinese banking sector include the Guangdong branch of the 
Industrial and Commercial Bank of China (ICBC) which in 2003 lost 740 million yuan; the Jinzhou branch of the Bank 
of Communications in 2004 which lost 22.1 million yuan; the Heilongjiang branch of the Bank of China (BOC) in 2005 
which lost 100 million yuan; the Guangdong branch of BOC in 2006 lost 400 million yuan; and the Qilu Bank in 2010 
which lost 100 million yuan. 

Each of these single loss events are significant and indicate the importance of models for loss processes which will capture 
such extreme loss events adequately when undertaking capital estimation. These illustrative examples and many more, 
has provided a clear focus for practitioners and risk modellers to invest in efforts to understand heavy tailed loss modelling, 
which has been highlighted in numerous reviews on OpRisk modelling, see [14] , [15] , [16] , [17] , [18] and [19] . In particular 
we will focus on the most widely used model framework involving a compound process reprsentation of the risk process. 

1.2. A Brief Background on Loss Distributional Approach (LDA) Models 

To quantify the operational risk capital charge under the current regulatory framework for banking supervision, referred 
to as Basel II/ Basel III, many banks adopt the Loss Distribution Approach (LDA). There are several modeling issues 
that should be resolved to use this approach in practice, a detailed review on the quantitative properties of estimation can 
be found in [115], [2D], [2T], [22], [23], [23], [23] and in addition in some important heavy tailed settings (large consequence, 
rare occurance) closed form representations of such models in [25], [TDJ and [15] . 

In this section we fist motivate and introduce the context of LDA modeling in risk and insurance. Next, we provide a 
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brief specifically selected survey of closed form analytic results known in the actuarial and risk literature for sub-classes 
of such LDA models as the Single Loss Approximations (hereafter SLA). As pointed out previously it is precisely these 
heavy tailed loss processes that result in the significant individual loss events, as discussed above. We therefore then 
focus on key elements of heavy tailed loss process asymptotics with a view to understanding capital approximations. In 
doing so we draw together several disparate sources of information for practitioners from sources in both mathematics 
and risk literature, for example we consider results recently developed in actuarial literature for the heavy tailed case 
corresponding to the first order and second order asymptotic approximations, see comprehensive discussions in a general 
context in [55], [27J and the books, [35J and the forthcoming [25] . 

We conclude this section by observing that according to regulatory standards and indeed good risk management prac- 
tice such asymptotic SLA approximations are often required to be accompanied with numerical and statistical solu- 
tions which can more readily take into account model uncertainty, parameter uncertainty, parameter sensitivity and 
asymptotic rate of convergence analysis, see discussions in |30j . 

The fact that such approximations are inherently asymptotic in nature, and may be inaccurate outside of the neighborhood 
of infinity, means such analysis is directly relevant in practice when considering the suitability of such approximations for 
capital calculations. It is important that these be both accurate and relatively stable over time. However, incorporating 
these features into a SLA is often highly challenging. 

To begin, consider the widely utilized insurance model known as a single risk LDA model. This represents the standard 
under the Basel II /III capital accords [3T] and involves an annual loss in a risk cell (business line/event type) modeled as 
a compound distributed random variable, 



for t = 1, 2, . . . , T discrete time (in annual units) and index j identifies the risk cell. Furthermore, the annual number of 
losses is denoted by which is a random variable distributed according to a frequency counting distribution P^'(-), 
typically Poisson, Binomial or Negative Binomial. The severities (losses) in year t are represented by random variables 
Xa*\t), s > 1, distributed according to a severity distribution F^\-) and there are N$ j) of them in year t. 

Before proceeding, it will be relevant to define some basic notation adopted throughout. In general, we will suppress the 
risk cell index j and time index t unless explicitly utilised. Therefore, we denote by F(x) a distribution with positive 
support for the severity model characterizing the loss distribution for each random variable X s for s€ {1,2,..., N}. We 
denote the annual loss (aggregated loss) by Z with annual loss distribution G = Fz and the partial sum of n random 
losses by S n with distribution Fs n — F n * where F n * denotes the n-fold convolution of the severity distribution for the 
independent losses. The densities, when they exist, for the severity distribution and annual loss distributions will be 
denoted by f{x) and fz(x) respectively. 

In constructing the LDA model we assume that all losses are i.i.d. with (t) ~ F(x) and that the severity distribution 
is continuous with no atoms in the support [0, oo). As a consequence, linear combinations (aggregation) of losses in a 




(1.1) 
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given year, denoted by the partial sum 

n 
s=l 

have the following analytic representation: 

F Sn (x) = (F*F*---F)(x)= [ F {n - 1) *{x~y)dF{x). 

«/[0,oo) 

In [32] it is shown that if F(x) has no atoms in [0, oo) then the n-fold convolution of such severity distributions will also 
admit no atoms in the support [0, oo). In addition we note that continuity and boundedness of a severity distribution F (x) 
is preserved under n-fold convolution. Hence, if F(x) admits a density 4zF(x) then so does the distribution of the partial 
sum Fg n , for any n£ {1,2,...} and compound process (random sum) Fz- For most models such analytic representations 
of the combined loss distribution are non closed form, with the exception of special sub-families of infinitely divisible 
severity distribution models, see [IB] . 



2. On the Road to Capital Approximation: a Tale of Tails 



In this section we present an overview of important technical results from the probability and mathematical statistics 
literature that will lead to an understanding of OpRisk capital approximation techniques that are being discussed in 
the OpRisk literature recently, see [33], [34], [35], [36] and [27]. Since this section is aimed at a guided review we make 
explicit some important definitions that are used throughout, for example we remind the reader of the notion of asymptotic 
equivalence and max-sum equivalence, as well as several key definitions relating to tail asymptotics for a severity model 
under an LDA framework that are not widely known by practitioners utilising such approximations. 

Definition 2.1 (Asymptotic Equivalence). A probability distribution function F(x) is asymptotically equivalent to 
another probability distribution function G(x), denoted by F(x) ~ G{x) as x — > oo if it holds that, Ve > 0, 3x$ such that 
Vx > Xq the following is true 

F(x) 



G(x) 



- 1 



< 6. (2.1) 



Definition 2.2 (Max-Sum Equivalence). A probability distribution function is max- sum- equivalent, denoted by F 

G, when the convolution of the tail distribution of two random variables is distributed according to the sum of the two tail 

distributions asymptotically, 

l-(F* G)(x) = (FTG)(x) ~ F(x) + G(x), x ->■ oo, 

see discussion in \ST^ . 

From these basic definitions, we next proceed to consider some key classifications of heavy tailed distributions. Though 
technical in nature, these will allow us to carefully understand the behaviour of both heavy tailed severity distributions 
and the compound processes constructed with these models under an LDA framework. 



2.1. Review of Classifications for Heavy Tailed Severity Distributions 

In practice the choice of severity distribution F(x) should be considered carefully for each individual risk process as 
it can have a significant impact on the capital and the choice of appropriate captial approximation method. This is 
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especially the case for those risk processes for which business managers believe there will be infrequent losses with very 
high consequence. It is therefore important to carefully consider the possible implications on capital calculation and 
capital approximation that arise when making particular assumptions about the severity distributions right tails. In this 
regard we begin with a basic coverage of the key features one may consider when deciding on a suitable heavy tailed 
severity distribution, with respect to the behaviour of the right tail, associated with probabilities of large loss amounts. 
We note that the classifications of severity models into particular families of heavy-taile models as discussed below is by 
no means supposed to be disjunctive in the understanding of such groupings. In addition we also point out that fact that 
we have several choices to consider when developing such high-consequence severity model assumptions. For instance we 
could assume properties of the right tail of the distribution function, the density function or the survival function. 

For large consequence events, one may often consider distributions for which the moment generating function doesn't 
exist on the positive real line such that 

r e tx dF(x) =00, Vi>0. (2.2) 

In otherwords, the standard Markov Inequality for the expontially decaying "light" tail behaviour of a loss distribution 
in which 

F(x) < cxp(-sx)E [exp(sx)] , Vx > (2.3) 

does not apply. 

It can be shown that a distribution F has a moment generating function in some right neighbourhood of the origin if and 
only if the following bound holds for some positive real numbers M and t, 

F(x) < Mexp(-tx), Vx > 0. (2.4) 

Hence, one basic definition of an important class of distributions in OpRisk is the heavy tailed distributions which have a 
right tail heavier than any exponential distribution. However, there are numerous more refined categorisations of heavy 
tailed distributions which are required for the results in this manuscript. 

A popular class of heavy-tailed models is the sub-exponential family of severity distributions that we denote by membership 
(F(x) £ F) and define below, see discussion in for example [38] and [39] . 

Definition 2.3 (Sub-exponential Severity Models). The sub- exponential family of distributions F defines a class of 
heavy tailed severity models that satisfy the limits 

lim — }\ - — n, (2.5) 

x^ao 1 - F(x) ' V ' 

if and only if, 

1-F 2 *(x) 

lim \ VV = 2 ' (2 - 6) 

k->oo 1 — t (x) 

In |40j it was demonstrated that the necessary and sufficient condition for membership of a severity distribution in the 
sub-exponential class (F £ F) is satisfied if and only if the tail distribution F(x) = 1 — F(x) satisfies 



lim f X ^yl F (y)d y = l. 
x-hx>J F(x) 
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Alternatively, one may characterize the family of distributions F £ T by those that satisfy asymptotically the tail ratio 

lim F{ *- y) =1, Vye[0,oo). (2.7) 

Severity models F g F are of interest for severity distributions in high consequence loss modeling since they include 
models with infinite mean loss and infinite variance. In addition, the class J- includes all severity models in which the 
tail distribution under the log transformed r.v., F(\og(x)), is a slowly varying function of x at infinity. We will discuss 
both regular and slow variation below. 

Examples of models in this family include: 

1. Pareto: F(x) = f^p^y) for x > 0, c > and a > 

2. Log Normal: F(x) = \ + |erf ^j=- , x > 0, p £ M and a > 0. 

3. Heavy-Tailed Weibull: F(x) = exp ( — Ax Q ), x > 0, A > and < a < 1. 

We may also consider classes of heavy-tailed severity distributions as classified by their right tail properties through formal 
definitions such as regularly varying tail, long-tailed, dominantly varying tail, subversively varying tail and smoothly- 
varying tail, each of which we briefly define below. We then relate these different classes of severity model assumptions 
to each other to provide a basic understanding of the relationships between each of these possible heavy tailed severity 
modelling assumption. 

Arguably one of the most utilised sub-classes of the sub-exponential distributions is the class of regularly varying dis- 



tributions. Now recalling the definition of the class of regularly varying functions given by Definition 2.4 see |41j and 
[38]. 

Definition 2.4 (Regular Variation). A measurable function f(x) > that satisfies the condition that 

lim 4t^t -> A p , VA > (2.8) 

x^oo f(x) 

is regularly varying with index p denoted by f £ RV p . 



We define the class of all regularly varying functions denoted by 1Z — \J p gs,RV p . It is also convention to distinguish the 
special sub-class of functions denoted generically by L(x) that are regularly varying with an index of p = as follows. 

Corollary 2.5 (Slowly Varying Tail). A function f £ 1Z is slowly varying if p = 0. 

So one may consider the class of severity distributions or densities that are members of this class. To provide some intuition 
for properties of severity models in this class of regularly varying models we note the following important features. It can 
be shown that for loss processes in which the severity models have a strictly non-negative support, the membership of a 
severity distribution in the class of regularly varying functions with tail index p > implies that 

fc. for some real constant c < oo, if 8 < p; 
(2.9) 
oo, if 8 > p. 
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Analogously (setting /3 = p + k) if the right tail of a distribution F is regularly varying with an index p then this implies 
that the distribution F will have (p + fc)-th moments which are infinite for k > 0. In addition the following is also known 
about truncated moments of severity distributions in which Fx{x) is regularly varying at infinity with p = —a for some 
a > 0. 



E[X?I X<X ]~ - ^—xPFxix), 13 > a. 

" (2-10) 

E [Xn x>x ] ~ x?F x {x), f3<a. 

1 J p — a 

Clearly, these truncated moments are often of direct interest in risk management and especially in the context of capital 
estimation. 

In addition, following properties of regularly varying functions apply specifically in the context of distributions and 
densities, see [32J Theorem 1.20] and [15] . 

Corollary 2.6 (Regularly Varying Severity Distributions). If the severity model has a regrularly varying distribu- 
tion F £ RV P with a density f which is locally integrable on [1, oo) with 

F(x) = f f(t)dt, (2.11) 

then given the severity density f is ultimately monotone, one has for p =/= that f G RV p -\. Furthermore, one can show 
that in the case of non-negative random variables, such as for a loss process in an LDA severity model, if the distribution 
is regularly varying F G RV p with p>0, then the right tail F(x) € RV_ a . 

Furthermore, in OpRisk when considering severity densities and distribution functions we are working with strictly 
positive functions. In this regard we note that positive regularly varying functions have a unique representation detailed 



in Theorem 2.7 see [44], [45] . [46] and [47] . This representation demonstrates an important property of such positive 
regularly varying functions, as it shows that the integration of regularly varying functions (tail functionals) will behave 
in the same manner as the integration of power functions. In addition, we note that in general the class of severity 
distributions and densities considered in OpRisk settings will be strictly monotonic in their tail behaviour. This is 
significant as it means that one will achieve uniform convergence in the limit taken in the definition of regular variation 
for such severity distributions. 

Theorem 2.7 (Karamata's Representation Theorem for Regularly Varying Functions). A function f is a pos- 
itive regularly varying function at infinity, f £ RV- P , with index —p if and only if f can be represented by 

f(x) = c(x) exp ( f ~ P + e ® dt) ,Vx>x (2.12) 



x 



with c{x) = c + o(l) for some c > and e(t) = o(l). 



Remark 2.8. It is worth considering the intuition and relevance of the Karamata Representation Theorem. In particular 
it demonstrates that when integrating regularly varying functions f € RV P , one can pass the slowly varying component 
outside the integral as follows 

F(x)= [ f(t)dt= [ t p Lit)dt = L(x) [ t p dt=^^-, (2.13) 
Jo Jo Jo P+ l 

see discussion in J3R p. 25]. 
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Remark 2.9. A consequence of this representation theorem is that every regularly varying function f € RV P will admit 
a representation given by, 

f{x)=x"L{x). (2.14) 

Remark 2.10. One way to understand the implications of working with severity models from this class of distributions is 
to consider what it tells the practitioner about integration of functions with respect to such severity distributions. Of course 
the key component in this regard will be the influence the right tail plays in such functionals. In particular if p > —1 and 
the severity density f £ RV P then the integration of such a function satisfies 



1-F(x)= / f{t)dt e RV p+1 . (2.15) 
Jo 

Furthermore, if the index of regular variation is p < — 1 then one has 

= I /(*)* G RV p+ i. (2.16) 



n 



In addition a distribution F which is regularly varying with index p can be characterised by what is widely known as the 



tail balance condition in Definition 2.11 sec 148 . 



Lemma 2.11 (Tail Balance Condition). A distribution function F is regularly varying with index p>0 if there exists 
p, q > with p + q = 1 and a slowly varying function L{x), which for all A > means 

Urn ^ = 1, (2.17) 

and the following tail balance conditions are satisfied as x — > oo 

F(-x) = qx-"L(x) (l+o(l)) 
F(x) =px- p L(x)(l + o(l)). 

Finally, it is important in the context of this manuscript which is considering capital approximations to make the connec- 
tion between the properties of the regularly varying distribution and its inverse (quantile function) that is often utilised 
pointwise as the mathematical measure of OpRisk capital, such as in the Basel II /III stipulated Value-at-Risk for some 
quantile level a. 

Lemma 2.12 (Regularly Varying Distribution and Quantile Functions). If the right tail of a distribution is reg- 
ularly varying at infinity such that F G RV- P with p > and F(x) = x~ p L(x), then it is also true that the quantile function 
F*~(t) = inf {x : F(x) > t} will be regularly varying at the origin, F^(t) G RV- p . If one defines Q(t) := F*~ (l — x) on 

[l,oo) then this leads to the representation Q(t) = t~p L" (t*j> where 1$ represents the De Bruyn conjugate of L~p, see 

details in \49\ p. 79]. 

Typical examples of slowly varying functions are functions converging to a positive constant, logarithms and iterated 
logarithms. Another important note to make here is that distributions such as the Pareto, Cauchy, Student-t, Burr and 
log-gamma, truncated a-stable distributions have regularly varying tails and are ultimately infinitely differentiable and 
their derivatives are regularly varying. 

One can also consider a related sub-classes of regularly varying severity models known as the class of Smoothly Varying 



functions given in Definition 2.13 see [SD] and [551 P-6]. 
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Definition 2.13 (Smoothly Varying Function). A real measurable function f £ SR- P (m) is smoothly varying with 
index —p and order m if it is eventually (asymptotically) m-times continously differentiable and the m-th derivative 
D m f(x) = f( m '(x) is regularly varying with index /( m ) £ RV- p — m . Furthermore, for any non-integer value u > 0, with 
u = to + r for to € N and r £ [0, 1), then a function f(x) is smoothly varying with index —p and order u if f £ SR- P ^ m 
and the following limit holds 

limlimsup sup f — 7 ^ = 0. (2.19) 

One can also state the smooth variation theorem which provides asymptotic bounds on the severity density, see [41]. 

Theorem 2.14 (Smooth Variation Theorem). If a severity density model has a regularly varying right tail f £ RV P 
then there exists functions fi and fi with fi £ SR P and asymptotic equivalence f\ ~ fi such that in some neighbourhood 
of infinity the following bounds apply f\ < f < fi . 

Corollary 2.15. If the severity density has a regularly varying right tail, f £ RV P , then there exists a function g £ SR P 
with g r~j f. 

This further characterization of a sub-class of regularly varying severity models is highly relevant as it allows one to make 
some comments on integrals of tail functionals with respect to such severity distributions. In particular, one can show 
the following is true for two smoothly varying functions when utilised to construct a product function, see [511 p. 47]. 

Lemma 2.16 (Products of Smoothly Varying Functions). Given two smoothly varying functions f £ SR p and 
4> £ SR a , then the product of these two functions is also smoothly varying as f(x)cj>(x) £ SR p + a . 

Remark 2.17 (Implications for Capital Approximations). This product closure for the family of smoothly varying 
functions, when combined with convolution closure properties of regularly varying functions to be explained below is 
particularly useful when integrating tail functionals of compound processes, such as would be required when calculating 
capital approximations under certain risk measures such as Expected shortfall or Spectral Risk Measures. 

Building on these sub-classes of regularly varying functions, one can also define additional notions or tail variation such 
as dominantly varying, subversively varying and long tailed severity models, see discussion in [5 2) 

Definition 2.18 (Dominantly Varying Tail). A severity distribution function F is said to have a dominantly varying 
tail if it satisfies the asymptotic condition that 

F(ux) 

limsup _ / < oo, (2.20) 

x-»oo F(x) 

for any u £ (0, 1). An alternative equivalent relationship is to consider 



Flux) 

liminf^ — - > 0, (2.21) 

x^oo F(x) 



for any u > 1. 



This notion of dominated variation is interesting to consider for the following reasons discussed in |53) . It is well known 
that for a severity model with a positive support, if the distribution F has a tail F which is regularly varying, then in 
this case it will imply that F is in the family of subexponential distributions. Alternatively, if the tail of the severity 
distribution F is instead merely of dominated variation, then it is no longer the case that the severity distribution F 
needs to be in the family of subexponential models. 
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Definition 2.19 (Subversively Varying Tail). A severity distribution function F is said to have a subversively vary- 
ing tail if it satisfies the asymptotic condition that 



l imsup ^)<l ! (2.22) 

x^oo t (x) 



for any u > 1. 



Definition 2.20 (Long Tailed Distribution). A severity distribution function F is said to be long-tailed if it satisfies 
the asymptotic condition that 

lim = I, (2.23) 

Z-S-OO F(X) 

for all constants i/gR. Equivalently one can state this by saying that F(x — y) is asymptotically equivalent to F(x), that 
is F(x — y) ~ F[x). 

The long tailed severity model case is particular interesting as members of this family have the property that the distri- 
bution of a random variable X ~ F is said to have a long right tail if for all A > 0, 

lim Pr [X > x + X\X > x] = I, (2.24) 

x— >oo 

or equivalently put in terms of asymptotic order it means that 

F(x + \) ~ F(x). (2.25) 

Therefore, the interpretation of a right long-tailed distributed quantity is that if the long-tailed quantity exceeds some 
high level, the probability approaches 1 that it will exceed any other higher level. Put simply if you know the loss amounts 
are significant then the realized losses from such a severity model are probably worse than you think. 

Remark 2.21. One can show that all long-tailed distributions are heavy-tailed, but the converse is false. In addition one 
has that all subexponential distributions are long-tailed, but examples can be constructed of long-tailed distributions that 
are not subexponential. 

Having defined these different families of tail behaviour in a severity distribution or density, we note the following 
relationships between these families, see discussion in (54] . 

Remark 2.22 (Relating the Families of Severity Models by Tail Behaviour). The following relationships between 
the different families of severity distributions, classified by their right tail behaviour holds: 

1. Firstly, the class of sub- exponential distributions is larger than the class of regularly varying distributions and one 



can observe the relationship through the result in Lemma 2.28, see H8\ Lemma 3.2]. Secondly, it is well known that 



the class of smoothly varying functions, functions with continous derivatives being regularly varying at infinity, is a 
subclass of regularly varying functions. 
2. The intersection between the family of dominantly varying, subversively varying and sub- exponential tailed distribu- 
tions is contained in the family comprised of the intersection between the dominantly varying tailed functions and 
the long tailed functions. Furthermore, these sub-families formed from the intersections are themselves contained in 
the family of sub- exponential models which is iteself contained in the family of long tailed distributions. 

It will be useful in generalizing assumptions on the frequency distribution in the LDA structure when obtaining the results 
for the single loss capital approximations to also consider two additional classification concepts for the families of heavy 
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tailed severity models, these are the notions of extended regularly varying functions (ER) and the O-regularly varying 
functions (OR) given in Definition 2.23 



First we define the following properties for the severity density, using the notation |41) . the lim sup and lin inf according 



to Equation 2.26 and Equation 2.27 for A > 



and 



/* (A)=hmsup^ (2.26) 



U(X)=]imM^- (2.27) 
x^oo j(x) 



with the relationship that /* (A) = -prjj 



Definition 2.23 (Extended and O-Type Regular Variation). The class of extended regularly varying functions is 
the set of postive measurable functions f € ER satisfying for some constants c, d the relationship 

A d < /.(A) < /* (A) < A c , VA > 1. (2.28) 

The class of O-regularly varying functions is the set of postive measure functions f G OR satisfying 

< /*(A) < /*(A) < oo, VA > 1. (2.29) 

It will also be beneficial to recall the definition of the Matuszewska index, see [55] and [HJ page 68] . 

Definition 2.24 (Matuszewska Index). Let f be a positive density function, then the upper Matuszewska index, de- 
noted a(f) is given as x — > oo by the infimum of the a values such that there exists a constant C — C(a) where for each 
A > 1 one has 

4^ < C{l + o(l)}A Q uniformly in A e [1, A]. (2.30) 
f( x ) 

The lower Matuszewska index, denoted j3(f) is analogously given as x — > oo by the supremum of /3 values for which for 
some constant D > and for all A > 1 one has 

>D U + o(l)} A* 3 uniformly in A e [1, A]. (2.31) 

The following relationship between Matuszewske indexs is known for positive functions f 

M = ~ a (?) ■ (2 ' 32) 



One can show the following properties of the Matuszewska Index for a postive function / given in Lemma 2.25 see [4H 
page 71]. 

Lemma 2.25. Consider a positive function f then the following properties w.r.t. the Matuszewska Index's sign can be 
shown: 

1. / has bounded increase f G BI if a(f) < oo.; 

2. / has bounded decrease f € BD if P(f) > — oo.; 

3. / has positive increase f €E PI if /?(/) > 0.; and 

4. / has positive decrease f € PD if a(f) < 0. 
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Of direct relevance to the results to be discussed in this manuscript on higher order asymptotic tail expansions will be 
the extension of the concept of Matuszewska indices which was further developed for distribution function tails in |56j 
who provided the statements in Lemma |2.26| 

Lemma 2.26 (Matuszewska Indices for Distribution Functions). Given a severity distribution F, then the upper 
Matuszewska index for the tail of the distribution (F(x) = 1 — F(x)) denoted by 7-p is given by 

7F = inf {J°f^) :M>1 } = _ lim l0 f* H (2-33) 

( lOgU J «->oo log It 

The lower Matuszewska index is given for the tail of a distribution analogously by 

[ \oeF*(u) 1 , logF*(u) 
&w = sup <^ 2 \_L ■ u > 1 K = _ n m (2.34) 

log U u-s-oo log U 

Analogous definitions can also be developed for severity density functions. 

When the upper and lower Matuszewska indices are finite for the tails of a distribution function one may state the 
following bounds in Proposition |2 . 2 7| 

Proposition 2.27. Given a severity distribution function F with a finite upper Matuszewska index yp < 00 then there 
exists constants C\ and xq such that the bound 



F(x) 



-7 



< C x - (2.35) 



F(y) ~ \y 

holds for all x > y > xq and j-p < 7 < 00 . Furthermore, if the lower Matuszewska index is finitely positive S -p > then 
there exists constants C2 and Xq such that 

gf\>cJ X -Y S (2.36) 
F{y) \yj 

holds for all x > y > xq and < 6 < 6-p. 

We now proceed with the process of utilizing these characterizations for the heavy-tailed severity models to obtain 
asymptotic bounds for the compound process tails and capital estimates. 



2.2. Single Risk Closed Form Compound Process Approximations of Asymptotic Tail Behaviour 

We begin by noting several important properties one can obtain when combining severity distributions from the above 
heavy-tailed families into LDA structures. We will focus purely on Poisson processes, though this can trivially be 
generalized to other frequency distributions of interest. We first consider the annual loss process for a fixed number of 
loss events N = n and state some properties of the partial sum with respect to assumptions on the tail behaviour of the 
severity model, see discussion in [ST], [SS] and [55] . 

Lemma 2.28 (Convolution Root Closure of Sub-exponential Distributions). Assume the partial sum Z n = ^27=i-^i 
is regularly varying with index p > with each Xi being i.i.d. with positive support. Then for all i G {1, . . . , n} the Xi 's 
are regularly varying also with index p and the following asymptotic equivalence as x — ^ 00 holds 

F z „(x) = Ft (Z n > x) ~ riPr {X x > x) , Vn > 1. (2.37) 
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This result can be restated analogously, for any n > 1, by showing that one has asymptotically as x — > oo, 

F*"(x) ~ nF(x) as x -> oo. (2.38) 

This means that given a sum of n independent random variables X\ , . . . , X n with common distribution F one has the 
following probabilistic interpretation for sub-exponential distributed random variables (often referred to as the 'big jump'), 

Pr [Xi + ■ ■ ■ X n > x] ~ Pr [max (X x , . . . , X n ) > x] , as x -> oo. (2.39) 

In addition, if one wishes to consider insurance mitigation in which each loss {A"i}™ =1 is mitigated by an insurance policy 
coverage to produce {ciXi}™ =1 for some Cj G [0,1], as for example under one of the insurance policy stuctures detailed 
in |25j . Then given the losses are from a severity model which is regularly varying and tail balanced, the result in 



Lemma 2.29 see (6T)l Appendix 3.26] can be interpreted as a generalization of the above result, in an OpRisk context, to 



incorporate simple insurance mitigations. 

Lemma 2.29 (Generalization to Insurance Mitigation). Consider an i.i.d. sequence of loss random variables Xi 
for i € {1, 2, . . . , n} with distribution function F which is regularly varying with index p > and satisfies the tail balance 
condition for some p + q = 1 in Definition \2.11\ Then for any real constants Cj € K and integer n > I one can show 

n 

Pr (cA + • ■ • + c n X n >x)~Pr ([Xi] > x) £ [p (c+) P + q (c+) P ] (2.40) 

i=i 

where c+ = V +Cj and c[ = V — Cj. 

Remark 2.30. ./Vb£e £/iat m general, except when n = 2, it is not tree that given a linear combination c\X\ + C2X2 + 
• • • + c n X n which is regularly varying with index p for an i.i.d. sequence of losses (Xi) , this does not imply that X\ is 
from a distribution which is regularly varying. 

If we now consider the compound process setting, in which the number of losses in the given year is treated as a random 
variable, then the following results can be developed for the right tail approximation as a function of the heavy-tailed 
assumptions of the severity model. 

For the class of heavy tailed sub-exponential LDA models we have that a probability distribution function F will belong 
to the sub-exponential class T if F F 1 i-e. it is max-sum-equivalent with itself and that the class J 7 is closed under 
convolutions. The implications of this for LDA models is clear when one observes that sub-exponential LDA models are 

compound process random sums comprised of an infinite mixture of convolved distributions, 

00 

G(x) = J2PnF n *(x), (2-41) 

n=0 

for a suitable series (e.g. convergent sequence satisfying Kolmogorov three series theorem). As an example, consider 

the case in which p n is defined by an LDA model constructed as a compound Poisson distribution, where each term will be 
given by p n = e~ A ^y. Then using the property of max-sum equivalence one can show the practically relevant asymptotic 
equivalence between the severity distribution F and the annual loss distribution G in which selecting F € T results in 
GeJ and _ 

lim G M = A. 

x^co F(x) 
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This asymptotic equivalence relationship between the severity distribution F and the annual loss distribution G, which is 
present for sub-exponential LDA models, greatly benefits the formulation of asymptotic approximations of tail functionals 
such as quantiles and tail expectations in such LDA models. It should also be noted that there also exists a special sub- 
family of such sub-exponential models which have the additional feature of being infinitely divisible in their severity 
models. The consequence of this additional feature is substantial as it often means that closed form representations of 
the distribution and density for the annual loss distributions can be obtained, see discussions in |25j and |16j . 

In general based on these properties we can obtain asymptotic approximations to the annual loss distribution tails which 
typically fall under one of the following classifications: 

• "First-Order" and "Second-Order" Single Loss Approximations: recently discussed in [35], [37], [ST] and references 
therein. 

• "Higher-Order" Single Loss Approximations: see discussions in [51 and recent summaries in [2jJ| and references 
therein. 

• Extreme Value Theory (EVT) Single Loss Approximations (Penultimate Approximations) : the EVT based asymp- 
totic estimators for linear normalized and power normalized extreme value domains of attraction were recently 
discussed in [r?T] . 

• Doubly Infinitely Divisible Tail Asymptotics given a-stable severity models discussed in jTH] and [55] 
2.3. First Order Single Risk Loss Process Asymptotics for Heavy-Tailed LDA Models 

Consider the compound process distribution and right tail distribution functions, given for annual loss Zjy = Yl^-i Xj, 

by 

OO 

F ZN (X) = Y,PnF* n {x) 

n=0 
oo 

71=1 

where it is assumed that the frequency probability mass function satisfies n P n ^ 00 ■ Then the following first order 

single risk loss processes tail asymptotic results apply when one considers a severity model with a regularly varying right 
tail. 

Theorem 2.31 (First Order Single Risk Asymptotic Tail Approximation). The following results for compound 
process tail behaviour hold: 

1. Assuming the distribution of Xi losses is in the subexponential severity family and an independent integer-valued 
random variable for the number of losses, N satisfies E [(1 + e)^] < oo, for an e > 0. Then the following asymptotic 
equivalence holds 

Vr(Z N > x) ~E[AT]Pr(Xi > x) . (2.42) 

2. Assuming the distribution of Xi is regulary varying with postive index p > and the mean number of losses is finite 
K[N] < oo and Pr (N > x) — o (Pr (X\ > x)). Then the following asymptotic equivalence holds 

Ft(Z n >x)~E[N]Ft(Xi>x). (2.43) 
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3. If one assumes the counting random varible N is regularly varying with index j3 > 0. If j3 = 1 then assume that 
E[N] < oo. Let the i.i.d. loss random variables (Xi) with finite mean E [Xi] < oo and tail probablity with asymptotic 
condition Pr (Xi > x) — o(Pr(N > x)), then assymptotically one has the equivalence for the compound processes 

Pr(Z N > x) ~ (E[Xx]fFr(N > x) . (2.44) 

4. Assume that Pr (N > x) ~ cPr (Xi > a:) /or some c > and Xi is regularly varying with an index p > 1 and w</i 
finite mean E [Xi] < oo. TTien one ftas i/ie asymptotic equivalence 

Pr (Zjv > x) - Pr (Xi > a:) (E[N] + c (E [X^f) . (2.45) 



The above results make intuitive sense, since in for example the heavy-tailed subexponential severity model case, the tail 
of the distribution of the sum and the tail of the maximum are asymptotically of the same order. Hence the sum of losses 
is governed by large individual losses, and therefore the distributional tail of the sum is mainly determined by the tail 
of the maximum loss. In addition in the case of the general sub-exponential models, we note that the results presented 
for the first order single loss tail approximation required that E[z ] < oo for some z > 1. This means that N must have 
finite moments of all orders. This clearly has implications on the properties of the severity distribution since we observed 
the result that 

lim F E?^ X > =E[iVl. (2.46) 
K-yoo F(x) 

To understand these implications we shall present a basic understanding how this first order asymptotic is derived. 



For these first order asymptotic behaviours of the right tail Fz N (x) one can consider obtaining an upper bound for the 
asymptotic ratio of the tail of the compound distribution and the severity distribution tail for each number of losses nej. 
This is given in Lemma 2.32| for the standard geometric Kesten bound and then in the more general bound in Theorem 



2.33 derived in [62l Theorem 7]. 



Lemma 2.32 (Kesten's Bound). If the severity distribution F is in the class of sub- exponential distributions then for 
each e > there exists a constant K — K(e) < oo such that for all n > 2 the following bound holds 

-^><K(l + er,x>0. (2.47) 
F(x) 



A generalized version of such an upper bound for the tail ratios for each number of losses n of F — ^ is given in Theorem 



Theorem 2.33. Consider a severity distribution F which is subexponential. Next define the sequence of constants 
i a n} n >o 9 iv en by 

F* n (x) 

a„ = sup^-Y. (2.48) 

x>0 F(X) 

Then a general bound on a n for each n which is only a function of F and F* 2 is obtained by using for any n = 1, 2, 3, . . . 

n— 1 n— 1 n— 1 

«„<E( i+c ^) n (i+^)+n( i+£ fc) ( 2 - 49 ) 

j = l k=j + l fe=l 
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with the empty product Hfc=i(l + £ fe) = 1 an ^ where the sequence T n , e n are defined to satisfy 



F* 2 (x) 

sup ^-Y - 1 < 1 + e„ (2.50) 

a;>T„ I (X) 



where e n can be selected arbitrarily small through selection of T n large and c n is given by 

c 5 Tn) 
n F(r n y 



(2.51) 



Then using one of these bounds on the tail ratios of the n-fold convolution of the severity model and the tail of the 
marginal severity model for each n one can extend to the compound process setting using the dominated convergence 
theorem. Given for example the standard Kesten bound one now applies the dominated convergence theorem given in 



Lemma 2.34 to the limit to obtain the result for the First Order Single Loss Approximation in (2.461 



Lemma 2.34 (Dominated Convergence Theorem). Consider a sequence of integrable functions {f n } on probability 
space (f2,£,/^) which satisfy the limit that 

lim f n (x) = f(x), almost everywhere — fi. (2.52) 

n— >oo 

In addition suppose that there exists an integrable function 

H>0: \f n (x)\ < H{x) Vn 

then if \f(x)\ < H(x) one has the limit result 



lim / f n (x)dfi(x) = / f(x)d»(x). (2.53) 

Therefore one utilise this bound and the dominated convergence theorem to interchange the order of the summation and 
the limit and then utilise the fact that for heavy-tailed sub-exponential severity models the condition that 

F* 2 (x) 

lim i^*± = 2 (2.54) 
characterizing this sub-class of distributions implies that 



lim— U= n (2.55) 

x-s-oo F(X) 



see [631 Lemma 1.3.4]. Hence one obtains for ths compound process ratio, 



lim hm f^ Pn ^M = f2np n =E[N], (2.56) 

x^oo Fix) X-HX)-*— ' Fix) , 



which is equivalent to the asymptotic equivalence statement that 



F Zn (x)~E[N]F(x). (2.57) 

Remark 2.35. In \62§ they weaken the condition of all moments of N being finite, based on the results presented i n \6$ - 
This relaxation involves considering less restrictive tail assumptions on the sub- exponential severity distribution F , in 
particular it is assumed instead that it is a function of O-regular variation F £ OR. Then two bounds are obtained for 
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the ratio S= ^ depending on whether the lower Matuszewska index (3(F) is in the interval (—00, —1] or (— 1, 0) then the 
corresponding bounds can be obtained where 

~F* E (x 



F(x) 

for some constants A and e > or 
for some constant A. 



< An l/3(F)l+1+£ , for F e OR and /3(F) < -1 (2.58) 



< An, for F E OR and /3(F) e (-1, 0) (2.59) 



A summary of results is provided in [621 Theorem 2] . 



2.4- Refinements and Second Order Single Risk Loss Process Asymptotics for Heavy- Tailed LDA Models 

The second order single loss approximation as discussed in [65) and derived in [66] takes the form given by the Theorem 



2.36 also see [57] [Proposition A3]. 



Theorem 2.36 (Second Order Single Risk Asymptotic Tail Approximation). The following results for compound 
process tial behaviour hold under a refined second order approximation: 

1. Assuming the severity distribution for losses Xi is zero at the origin (x — 0) and satisfies that both the tail distribution 
F and the density f are subexponential. Furthermore assume the the frequency distribution N ~ Fjy(n) is such that 
its probability generating function given by 



Pn(v) — E [v N ] = ]TPr(jV 



n)v 



n=Q 



is analytic at v = 1, then one has for finite mean severity models (E[X] < oo), that 



lim 

x— >oo 



F z (x) -E[N]F(x) 



E[X]E[(N- l)N] 



(2.60) 



(2.61) 



If the severity distribution has an infinite mean and the density satisfies f € RVLi/p—i for 1 < f3 < 00 then one has 

F z {x)-E[N]F(x) 



lim 



™ f(x)JZF(s)ds 



= c E[(N-l)N] 



(2.62) 



with c\ = 1 and cp = (1 — f3) 2 r(i-2//3) f or $ ^ C^' °°)- 
2. Assuming the losses Xi have a subexponential severity distribution function F in which E[X] < 00. Furthermore, 
assuming that the severity distribution admits admits a continous long-tailed severity density f(x) which is domi- 
nantly varying with an upper Matuszewska index given by a(f) < — 1. Futhermore, assuming that there is also an 
independent integer-valued random variable for the number of losses, N satisfying E [(1 + e) w ] < 00, for an e > 0. 
Then the second-order asymptotic approximation of the compound process tail probabilities, as x — > oo, are given by 



F z (x) = E [TV] F(x) + 2E 



Nl 



E [X] /(a;). 



(2.63) 



(iV-2)!2!_ 

3. Assuming the losses Xi have a severity distribution function F which has regularly varying tail with index p g (0, 1] 
such that F(x) £ RV- P and a severity density f(x) which is regularly varying, then the second order approximation 
for F z{x) is given by 



Fz(x) 



E [N] F(x) 



(2-p)r(2- P ) 
(p-i)r(3-2 P ) 



E 



Nl 



E [N] F(x) + 2E 



m 

(iV-2)!2! 



(iV— 2)121 

f(x)f*F(y)dy, 



f(x)J* F(y)dy, 0<p<l 
p = l. 



(2.64) 
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Remark 2.37. Note many distributions will satisfy the conditions in Theorem \2.36\ such as Log-Normal, Weibull, Benk- 
tander Type I and Type II and numerous others. 

Remark 2.38. These first and second order repesentations are remarkable as they each state that for the annual loss 
distribution under an OpRisk LDA model for general sub- exponential family severity models, when considered at high 
confidence levels, the resulting quantiles of the annual loss distribution become only dependent on the tail of the severity 
distribution and not on the body. Therefore when making such an asympototic approximation it is convenient that one 
only requies a quantification of the mean of the frequency distribution. Consequently, over-dispersion as captured by 
Negative-Binomial processes will not affect the high confidence level quantiles of the annual loss distribution. 



3. The Journey Completes: Going from Compound Process Tail Asymptotics to Capital Approximations 

In this section we will consider asymptotic approximations of key risk management quantities known as risk measures 
which are used in the allocation of capital and reserving in all financial institutions and stipulated as standards under 
regulatory accords in both Basel II/III and Solvency II. Examples of such tail functionals include the calculation of 
Value-at-Risk (VaR), Expected Shortfall (ES) and Spectral Risk Measures as detailed below in both their definitions and 
the resulting simple first and second order asymptotic approximations one may consider. 



Proposition 3.1 (First-Order Approximate Risk Measures). These asymptotic expansions allow one to obtain es- 
timates of common risk measures, see I68\j and \bty , such as: 

1. Value-at-Risk (VaR): for a level a G (0,1), given by the quantile of the annual loss distribution, 

VaR z (a) = F|"(a) = inf {z el:f z (z)> a} 

<- ( , «_ ( l-a\ (3.1) 



E[N] 1 w 7 V E i N ] J ' 

where F*~(-) is the generalized inverse, see J7Q| /. 
2. Expected Shortfall (ES): for a level a £ (0, 1), the expected shortfall (ES) is given by the tail expectation of the 
annual loss distribution according to 



1 f 1 

ESz(a) = E \Z\Z > VaR z (a)} = / VaR z (s)ds 

1 - a J a 



( 1 - W^T ) ~ Z~^ VaR z («) . 



(3.2) 



a - 1 V / a - 1 

see J7i| /; and 

3. Spectral Risk Measure (SRM): for a weight function cj> : [0, 1] n- R given by 



SRM z (0) = / <f>(s)VaR z (s)ds 
Jo 



(3.3) 



with Vt E (l,oo) a function </>i(l — 1/t) < Kt e for some K > and e > where 

K{a,4n)= s 1 ' l3 ^4 )1 {l-l/s)ds. (3.4) 
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For the Spectral Risk Measure (SRM), in [75] it is shown that if an individual has a Constant Absolute Risk Aversion 
(CARA) utility function with coefficient of absolute risk aversion £ then the SPR should be given as 



SRM 4>k {k) = / Ms)VaR(s), 



where 



with 



cf> K (s) = (1 - f 1 - ^) /[k,i](s) 



0i («0 



£e 



1 - e- 



Note that if one take 4>i(t) = lVt G [0, 1] then the Spectral Risk Measure resumes to the Expected Shortfall. 



Proposition 3.2 (Second-Order Approximation of the VaR). Assume that severity distribution F has finite mean, 
and the hazard rate h(x) — ^Lp^lx) * s °f re 9 u ^ ar variation h G RV—p for j3 > 0, then as a — ¥ 1 one has for the inverse of 
the annual loss distribution the result (see [26] and 1671), 



VaRz(a) = F z \a) = F~ I 1 



1 - a 
E[N] 



{l + c m (F- 1 (a))+o(g 1 (F _1 (a)))} 



where a = 1 — (1 — a)/E[JV] and 



0i(z) 



l-F[x) ' 

F( s )dsf(x) 
1-F(x) ■ 



if ELY] < oo, 
ifE[X] = oo. 



~ _( m] %r m , ifE[A]<oo, 
^-| s ^v-i M) ifE[iV]=0O) 



where we define 



c /3 



= 1, 

(i-/3)^4|, if pe(i,oo). 



(3.5) 



(3.6) 



Proposition 3.3 (Second-Order Approximation of the SRM). For a given tail function F (t) G RV_ a define Up it) 
(1/(1 — F))^~(i) G i?Vi/ a . iVow assume that for some p < 0, 3A(t) G i?V" p suc/j i/ia£ 

hm 



Then, 



as k — ¥ 1. where 



SRM^k) 



t_, +00 ^) ^ [/ F ( t ) 



/C(a,0i)+A(^ffl^X(a,0i,p) 



1 



1 - a 
E[N] J ' 



M 



(a,&,p) = l -£ °t 1 / a - 2 {tP-l)4> 1 (l - ~) dt 



and JC(a,<pi) as defined in 3.4 see details in \72^ . 



Corollary 3.4 (Second-Order Approximation of the ES). Let 4>i(t) = IVi G [0, 1]. Then, when K-)liue have the 
following Second-Order approximation of ES(n), see \72^: 



ES{k) 



a 



a 



ELY] 

a — 1 (a — ap — l)(a — 1) \1 — k 



F^ 1 



I- a 
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The properties of such asymptotic single loss approximation estimates are still an active subject of study with regard 
to aspects such as explicit approximation errors, unbiased quantile function estimation, asymptotic rates of convergence, 
sensitivity to parameter estimation and model misspecification. In the following examples, we illustrate the properties of 
these first and second order approximations presented above for two popular heavy tailed LDA risk models. 



4. Examples 

In this section we consider two popular examples of LDA models that are regularly considered in OpRisk settings. These 
are the Poisson-Log Normal and the Poisson-Inverse-Gaussian LDA models. We detail the asymptotic approximations as 
a function of the quantile level a for the VaR in both models and the tail function in the Poisson-Inverse-Gaussian, where 
the tail is known in closed form. In particular we consider a range of different parameter settings affecting the heavy-tailed 
features of the severity model and assess the accuracy of the asymptotic approximations, versus an exhaustive Monte 
Carlo simulation approximation of the true solutions. This provides interesting information of relevance to assessing the 
rates of convergence of these asymptotic results, which are currently unknown in the literature. In addition, we study the 
sensitivity to the parameter estimation in the accuracy of the asymptotic risk measure approximations. 



Example 4.1 (Single Risk LDA Poisson-Log-Normal Family). Consider the heavy tailed severity model, selected 
to model the sequence of i.i.d. losses in each year t, denoted {Xi(t)} i=1 . N , and chosen to be a Log-Normal distribution 
Xi ~ LN(/j,, a) where the two parameters in this model correspond to parameterizing the shape of the distribution for the 
severity a and the log-scale of the distribution fj,. The survival and quantile functions of the severity are given by 



1 



e 2<t2 j x > 0; fi £ 



g > 



F{x;n,a) = l-F(x) 



1 1 . 
= 2 + 2 erf 



1 



In a; — fi 



exp 



(log(u) 



du 



Q{p) = exp U + cr$~ 



v 2a 2 
(p)) , < p < 1. 



, x > 0; \i ela > 



Therefore the closed form SLA for the VaR risk measure at level a would be pre sente d in t his c ase under first and second 



order approximations for the annual loss Z — Yln=i -^i according to Equations i4-l) and (4-2), respectively 

'1 - a" " 



VaR a [Z] = exp 
VaR a \Z\ = F- 1 



A* 



1 - 



1 — a 



A 



A 



{l + c^OF- 1 ^))}- 1 ) 



(4.1) 
(4.2) 



where a = 1 — (1 — ot)/\, g\{x) 



m 

l-F(x) 



and 



C/J 



e „+- 2 /2 A . 



We will now compare this first order and second order asymptotic result to the crude Monte Carlo approach (for which 
one can generate uncertainty measures such as confidence intervals in the point estimator). To complete this example, 
we illustrate the basic Monte Carlo solution for the VaR for a range of quantile levels of the annual loss distribution, 
we display these along with the measured confidence intervals in the point estimators and we compare these to the first 
order SLA asymptotic result. The quantiles a £ {0.70,0.75,0.80,0.85,0.9,0.95,0.99,0.995,0.9995} are considered where 
the 99.5% and 99.95% quantile levels do in fact correspond to regulatory standards of reporting in Basel II/III. 
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Figure 2: VaR approximation for the Poisson-Log-Normal example. 
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Example 4.2 (Single Risk LDA Poisson-Inverse-Gaussian Family). Consider the heavy tailed severity model, se- 
lected to model the sequence of i.i.d. losses in each year t, denoted {Xi(t)} i=1 . Nt , and chosen to be an Inverse- Gaussian 
distribution Xi ~ IGaussQl, A) where the two parameters in this model correspond to parameterizing the scale of the 
distribution for the severity Jl and the scale of the distribution X. The survival and quantile functions of the severity are 
given by 

,(^> = (A)'V^-%^} 

F(a:;/Z,A) = * 




Therefore the closed form SLA for the VaR risk measure at level a would be pre sente d in t his c ase under first and second 



order approximations for the annual loss Z — X^=i %i according to Equations (4-3) and \4-4h respectively 



VaR a [Z] = F- 1 (4-3) 

VaR Q [Z] = F- 1 (l - I^l + c^OF- 1 ^))}- 1 ) (4.4) 

where a = 1 — (1 — ot)/\, g\(x) = i!}f\ x ) an dcp = J1X. 

Since the Inverse-Gaussian family is closed under convolution, ie, 

n 

Xi ~ IGauss(Jl, A) => S n = Xi ~ IGauss(njl, n 2 X), 

i=l 

we can calculate the distribution of the compound process analiticaly (see the comparison with the approximations on 
Figure The drawback of this family is that there is no closed form for the inverse cdf } which obliges us to resort to a 
numerical procedure for obtaining the quantiles, fortunately this is efficient and accurate for this class of models. For this 
model we also present first and second order approximations for the VaR and SRM for different choices of parameters on 
Figures [^J [3| 

One can see that even in these relatively simple examples, depending on the values of the parameters in the LDA risk 
model, the asymptotic VaR approximation may or may not be accurate at quantile levels of interest to risk management. 
Therefore, even small amounts of parameter uncertainty in the LDA model estimation may manifest in significanlty 
different accuracies in the SLA capital estimates. 

In the Poisson-Log-Normal Example, we can see on Figure [2] that the volatility of the severities, a, play a very important 
role on the accuracy of the approximations. It is also important to note that although in all the cases the second 
order approximations does not perform very well for quantiles bellow the 95-th percentile above this threshold it usually 
performs better than the first order. 

For the Poisson-Inverse-Gaussian case the greatest sensitivity is clearly on the rate A (see, for example, the bottom-left 
plot on Figure In difference to the Log-Normal example, the second order approximation seems to perform always 
better than the first order, but none of them are sufficiently close to any of the "true" (Monte Carlo) VaR. 

These results therefore serve to illustrate the importance of understanding and developing further studies on the rate of 
convergence of these asymptotic single loss approximations. This will help to guide in the understanding of when they 
can be reliably utilised in practice. 
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Figure 3: Tail function approximation for the Poisson-Inverse-Gaussian example. 
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Figure 4: VaR approximation for the 
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Figure 5: SRM approximation for the Poisson-Inverse-Gaussian example. 
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5. Conclusions 



This work presented an extensive journey through some of the fundamental concepts necessary to the understanding 
of approximation of OpRisk measures under the Loss Distributional Approach (LDA). We discussed some of the most 
important classes of severities distributions classified by their right tail properties (such as regularly varying and long 
tailed) and their interrelations, which we think can help practitioners in the choice of the apropriate heavy tailed model. 
Due to the complex nature of these models, assymptotic results are often required for the calculation of risk measures 
(VaR and Expected Shortfall, for example) and a review of how these so-called First and Second-Order approximations 
are obtained was provided alogn with a summary of several key results. Finally, for two popular LDA models First and 
Second-Order approximations can give precise results, but are highly sensitive to the parameters. 
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